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those associated with multimedia content servers, e.g., 
bandwidth, CPU, storage, etc.,). The system comprises a 
plurality of server resources, preferably with target multi- 
media content, real time processing engines, etc., and a 
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ries and resource monitors). The resources in any particular 
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the ratio of global to local is specified and controlled by the 
system. The system assigns clients across local and global 
resources and coordinates the placement of replicas of target 
content across global resources. The placement is dynamic 
and performed when necessary based on the analysis of 
utilization patterns of target content and replicas by plurali- 
ties of clients. 
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Note: 

s1 is local storage bin, 
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SELF-REGULATED RESOURCE 
MANAGEMENT OF DISTRIBUTED 
COMPUTER RESOURCES 



BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates generally to techniques for 
managing distributed multimedia resources on wide area 
networks characterized by large bandwidth and resource 
reservation capabilities, and, more specifically, to a system 
that provides the ability to map, reserve, monitor, and 
manage a dynamic plurality of computing and storage 
resources via a collection of arbitrators, policies and direc- 
tories. 

2, Discussion of the Prior Art 

The task of leveraging the increased availability of widely 
distributed content and resources becomes very important 
with the proliferation of the next generation of the Internet, 
e.g., Intemet2. The emerging Internet projects address the 
creation of a leading edge network for a new generation of 
applications that fully exploit the capabilities of broadband 
networks. Very high bandwidth and bandwidth reservation 
will allow materials such as continuous digital video and 
audio to move from research use to much broader use and 
include images, audio, and video in a way currently not 
possible. In such a widely distributed environment, 
accountable, efficient, and self-regulated management of 
resources will be desirable and most importantly, necessary. 

The driving force behind the movement of Internet to the 
next generation is the commercialization of rich multimedia 
content. Digital library collections produced by 
corporations, entertainment material created by movie 
studios, and interactive instructional presentations devel- 
oped by universities are soon to be available over the 
Internet, thus creating a new and broad source of revenue. 

The emerging Internet relies on the bandwidth, which is 
on the order of several magnitudes larger than current 
Internet provides. It also alleviates network resource man- 
agement and QoS control by introducing correspondent 
reservation and monitoring mechanisms. However, it is 
clear, that to date, mechanisms for the collective manage- 
ment of multiple media connections that efficiently leverage 
the sharing of resources across multiple servers in a wide 
area network are not found. 

There is envisioned three major conditions for successful 
commercialization of those newly arising applications: first, 
mechanisms need be provided to allow paying users to 
establish a contract with service providers to reserve 
required infrastructure components and resources at a mutu- 
ally agreed price for which providers establish and support 
a guaranteed quality of service; second, the resources supply 
would have to be sufficient to meet random changes of the 
demand, which may be completely unpredictable during 
architectural studies; and, third, service providers should 
safely rely on the system for effective security, rights and 
royalties management, accounting and billing for the con- 
sumption of dynamically re -configurable distributed virtual 
resources. 

The current focus of resource management in the today's 
Internet, if any, relates to the setup and management of 
individual and independent media connections to server 
resources. However, the danger of this approach becomes 
clear when the presentations reuse multiple primary sources 
of content. To enforce the necessary quality as well as to 
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control the usage and distribution when reusing multiple 
sources of content, two approaches are possible. One 
approach is to copy all content onto a single server (or a 
cluster of servers) during authoring, and rep heating, as 

5 necessary, the final result to as many servers according to 
predicted demand. Primary content providers would then 
establish copyright charges, based on a priori market analy- 
sis. On the positive side, the control of distribution, security, 
and billing functions become much easier, than in case of 

10 distributed content. On the negative, if the demand is 
estimated incorrectly, the profit is not maximized for either 
primary or secondary (i.e., reuse) content providers. Finally, 
the most dangerous problem is that this approach leads to 
over-engineering of resources while it does not prevent 

15 dropout of excessive requests. Such an approach is typical 
for today's Internet, because current multimedia content is 
generally not memory hungry, as compared with emerging 
multimedia applications. 

Another approach would be to reassemble content on a 

20 need basis, for both authoring and dissemination. It would 
allow content to be stored once, but used as many times as 
necessary, establish charges proportionally to content and 
resources usage, and alleviate storage demand. However, it 
requires a system to dynamically manage multiple and 

25 frequently heterogeneous resources. In addition, this 
approach exacerbates security, and resource engineering. A 
demand for the particular segment can not be predicted at 
all, because this segment may be used in completely 
different, even orthogonal applications. Now, if the demand 

30 for a single segment can not be met, multiple applications 
are affected. The latter approach, however, is the only 
sensible way to be used by future Internet, because from the 
resource point of view, it is the most economical, and 
serving a maximum number of users. 

35 Thus, it would be highly desirable to provide a system and 
method that allows all three major commercialization con- 
ditions to be satisfied. 

There are a number of publications and patents in the area 
of QoS-driven resource management. Hie most of the work 

40 has been focused on either the network, as described in U.S. 
Pat. No. 5,388,097 issued Feb. 7, 1995 to Baugher, M. J. et 
aL, and entitled "System and Method for Bandwidth Res- 
ervation for Multimedia Traffic in Communication 
Networks/' and U.S. Pat. No. 5,581,703 issued Dec. 3, 1996 

45 to Baugher, M. J. et al, and entitled "Method and Apparatus 
for Reserving System Resources to assure Quality of Ser- 
vice"; or, the operating system, such as described in the 
reference "An Architecture Towards Efficient Os Support for 
Distributed Multimedia", Proceedings of IS&T/SPIE Mul- 

50 timedia Computing and Networking Conference '96, San 
Jose, Calif., January 1996 by David K. Y. Yau and Simon S. 
Lam. With the proliferation of multimedia services on 
Internet, it was soon realized that while IP networks were 
able to provide a simple, best-effort delivery service, the IP 

55 protocol is not suited for use with new real-time 
applications, such as multimedia streaming, Virtual Reality 
applications, distributed supercomputing. As a result, new 
network protocols, such as Resource Reservation Setup 
Protocol (RSVP) (See, e.g., "The Grid: Blueprint for a New 

60 Computing Infrastructure," Edited by Ian Foster and Carl 
Kesselman, Chapter 19, pp. 379-503, Morgan Kauffman 
Publishers, 1999); Real Time Transport Protocol (RTP); 
Real Time Transport Control Protocol (RTCP) and others, 
were developed (See, e.g., William S tailings, "High-Speed 

65 Networks: TCP/IP and ATM Design Principles", Prentice 
Hall, 1997; and, I. Busse, B. Deffner, and H. Schulzrinne, 
"Dynamic QoS Control of Multimedia Applications based 
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on RTP", Computer Communications, January 1996), Columbia University such as described in the reference 

enabling applications to request and negotiate network QoS entitled "Building Open Programmable Multimedia 

parameters, such as bandwidth and latency. Deployment of Networks", Computer Communications Journal, Vol. 21, 

those protocols on the current Internet has not been No. 8, pp. 758-770, June 1998 by Campbell, A. T., Lazar, A 

successful, firstly because it required upgrading all the 5 A., Schulzrinne, H. And R. Stadler; OMEGA end-point 

non-RSVP routers and servers system software. Secondly, architecture, developed as the interdisciplinary research 

even if RSVP were deployed on the current Internet, very effort in the University of Pennsylvania such as described in 

limited bandwidth and computing resources would still have the reference entitled "Design, Implementation and Experi- 

been the bottleneck for successful deployment of real-time ences of the OMEGA End-Point Architecture", Technical 

applications. The current Internet was built on the backbone, ]Q Report (MS-CIS-95-22), University of Pennsylvania, May 

enabling cross-country communications on relatively 1995 by Nahrstedt K. And J. Smith; in-serv Architecture 

unclogged T3 (45 megabit per second). Proliferation of which is a contribution of the Internet Engineering Task 

graphic pages, and streaming audio and video applications Force (IETF) such as described in the reference entitled "A 

depleted those resources quite fast. Even worse, the rate of Framework for End-to-End QoS Combining RSVP/Intserv 

user's population growth is considerably higher than newly 1$ and Differentiated Services," Internet Draft, IETF, March 

build network resources. 1998 by Bernet Y, et al.; the Quality of Service Architecture 

The National Science Foundation and MCI Corporation, QoS-A, developed by A. Campbell, and presenting an inte- 

responding to the emerging needs of Internet community has grated framework dealing with end-to-end QoS require- 

been building a new network, called vBNS (very -high- ments such as described in the reference entitled "A Quality 

performance Backbone Network Service). This nationwide 20 of Service Architecture", PhD thesis, Lancaster University, 

network also provides a backbone for the two foundations, January 1996 by Andrew T Campbell. Another reference 

university-led effort called Internet 2 and by federal research which analyzes the above mentioned QoS paper is entitled 

agencies, called New Generation Internet. The vBNS allows "A Survey of QoS Architectures", ACM/Springer Verlag, 

most of the connected institutions to run at 622 million bits Multimedia Systems Journal, Special Issue on QoS 

per second (OC12). By the year 2000, vBNS is expected to 25 Architecture, Vol. 6, No. 3, pp. 138-151, May 1998 by 

operate at 2.4 gigabits per second (2,400 megabits per Aurrecoechea, C, Campbell, A. T. and L. Hauw. 

second) by the year 2000. Substantial work has been done by SRI International, 

The vBNS system exploits RSVP protocol to support two developing an End-to-End Resource Management of Dis- 

distinct classes of services: a Reserved Bandwidth Service, tributed Systems (ERDoS), which enables adaptive, end-to- 

i.e. a service with bandwidth commitment, and a traditional 30 end, scalable resource management of distributed systems 

best-effort IP service (See, e.g., Chuck Song, Laura Cun- such as described in the reference ERDOS QoS 

ningham and Rick Wilder, "Quality of Service Development Architecture, Technical Report, SRI International, May 

in the vBNS", MCI Communications Corporation, provided 1998. An extensible Resource Specification Language 

at the URL http://www.vbns.net/presentations/papers/ (RSL) and the resource management architecture has been 

QoSDev/ieeeqos.htm. Still, resource management at the 35 implemented within Globus meta-computing toolkit, and 

network layer for vBNS is done separately from operating used to implement a variety of different resource manage- 

system layer and in isolation from application needs and ment strategies such as described in Czajkowski, K., et al., 

availability of the end -resources, such as storage and com- "A Resource Management Architecture for Metacomputing 

puting resources. Systems" Proc. IPPS/SPDP '98 Workshop on Job Schedul- 

A new breed of high performance applications such as 40 m g Strategies for Parallel Processing, 1998; and Foster, I., 

remote surgery, robotics, tele-instrumentation, automated Kesselman, C, "The Globus Product: A Status Report" Proc. 

crisis response, digital libraries of satellite data, distance IPPS/SPDP '98 Heterogeneous Computing Workshop, pp. 

learning via multimedia supported Web sites, enhanced 4-18, 1998. 

audio, and video, is emerging. However, to accommodate While the architectures described in the above-mentioned 

such high performance applications and their continuous 45 references are directed resource reservation and manage- 

media flows, it is not enough to increase or reserve network ment of end-to-end resources, they generally assume a 

capacity. These new applications require end-to-end single, even geographically limited network subsystem 

resource reservation and admission control, followed by which provides bounds on delay, errors and meet bandwidth 

co-ordination of distributed functions such as: (a) resource demands, and an operating system which is capable of 

scheduling (e.g., CPU, disk, etc.) at the end-system(s), (b) 50 providing run time QoS guarantees. However, the next 

packet scheduling and flow control in the network, and (c) generation Internet must be viewed not as only a network of 

monitoring of the delivered end-to-end quality of service. It networks, but first and foremost a system of distributed 

is essential that quality of service is configurable, predict- systems. In this paradigm, not only the communication 

able and maintainable system-wide, including the end- resources, but also the computing and storage servers are 

system devices, communications subsystem, and networks. 55 shared among many users. 

Furthermore, all end-to-end elements of distributed systems Thus, the architectures mentioned above do not provide a 

architecture must work in unison to achieve the desired coordinated management of overall system resources as a 

application level behavior. function of request activities for individual content and 

Up do date, there has been considerable effort in the computing resources. Consequently, quality of service must 

development of end-to-end quality of service support. 60 be degraded in response to growing volume of requests for 

Among them are Heidelberg QoS Model, developed within such services over and above an established limit. As the 

HeiProject at IBM's European Networking Center and above-mentioned architectures focus on providing QoS as 

described in the reference entitled "HeiRAT — Quality of requested by application, they do not take an advantage of 

Service Management for Distributed Multimedia Systems", a possible aggregation of resources due to commonality 

Multimedia Systems Journal, 1996 by Volg, C, Wolf, L., 65 between user requests for a particular service. For example, 

Herrtwich, R. And H. Wittig; an Extended Integrated Ref- it would be desirable to determine commonality for the 

erence Model (XRM), developed by COMET group at usage history of a particular multimedia content, e.g., bursts 
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of requests within short time intervals, the proximity of clients requesting access to target content, global server-side 

origination addresses of requests, etc. Id addition, the archi- resources are allocated. A plurality of intermediary system 

lectures described above do not allow for dynamic moni- resources (such as directories and resource monitors) enable 

toring and recording of resource consumption for individual the management of such requests for target content in a 

services as well as for groups of related services, with the s proc ess referred to as mapping, brokering and negotiation, 

purpose of calculating cost of service for individual clients. According t0 one aspect of the disc i osur e, the capacity of 

Thus, it would be highly desirable to provide a mecha- bbal resources ^ dynamic and performed when deemed 

nism ^capable ^of providing an adapUve resource management nece Wed OQ ^ ^ ^ Qf utiHzation Uems of 

function for distributed resources that is suited for the next . ♦ » * a r L i r** c f ■ * tl 

generation of Internet. , n lar 8 et «=°^nt and replicas by pluralities of Cents. The 

10 system dynamically shapes capacity by (a) controlling the 

SUMMARY OF THE INVENTION number of replicas associated with an object and (b) the 

It is an object of the present invention to provide a system placement of these replicas, 

and method for managing multimedia content and server- According to another aspect of the invention, the SCP 

side resources that exploits the unique characteristics of additionally controls the mapping of the client requests, 

future computer networked environments. 15 Particularly, methods are provided to dynamically shape 

Particularly, it is an object of the invention to provide a demand for an object based on criteria such as arrival time, 

system and method for managing and controlling the incoming geography, and costs requirements. In particular, 

distribution, sharing and pooling of resources in an Internet/ the present invention characterizes future demand for an 

World Wide Web environment in such a manner that is object based on aggregation and forecasting of past demand 

beneficial, accountable, and seamless to the users who are 20 for such object. This aspect of the invention is described in 

requesting access to multimedia content. conjunction with the capacity shaping mechanism. The SCP 

It is a further objective of the invention to provide a coordinates the dynamic placement of replicas of target 

resource management system that self-regulates, according content across global resources. 

to policies, a plurality of end resources and server-based Still another aspect of the invention relates to manage - 
capabilities. 2 ment of the end-resources. A skilled artisan will appreciate 
The present invention is directed to a self -regulated that today resource management is not feasible in part 
distributed network of system resources, i.e., a "resource of because interfaces to end systems are closed and proprietary, 
resources". Particularly, a distributed system referred to as a According to the system and method of the invention, 
Service Control Plane (SCP) performs self-regulating man- 3Q interfaces and mechanisms are provided that facilitate 
agement of distributed resources comprising: a plurality of resource monitoring, setup, and management, 
end resources such as (but not limited to) servers and To make such resource management a possibility, the SCP 
databases providing access to (multimedia) content and allows an end-resource to be both globally and locally 
services, a plurality of clients requiring access to content and managed with global components managed by the system, 
services, and a plurality of system resources connected by a 35 and local components self-managed by the resource, 
computer networking environment. Herein a resource is referred to as a server-based capa- 
According to the principles of the invention, the service bility. In the present invention, a server is represented in 
control plane is an intelligent intermediary decision plane terms of capabilities which represents a heterogeneous 
for managing and enforcing end-to-end properties in the media unit streaming capacity. Each capability may be 
availability of end resources in a computer networked envi- 40 assigned to local or global management, with the service 
ronment. The service control plane receives stimuli (i.e., control plane permitting the management of global capa- 
application requests) from a plurality of clients desiring to bilities while the server itself manages local capabilities. A 
access target multimedia content, for instance, and responds server administrator allocates the ratio of global and local 
to these with end-to-end mapping recommendations satis- capabilities at a server. Resource capabilities need not be 
fying one or more properties (policies) set in the service 45 uniform. However, capabilities need to be normalized and 
control plane. When a client accepts a mapping consequently, the resource is viewed as a set of storage and 
recommendation, the service control plane guarantees its service bins that represent capabilities for provisioning 
performance as set forth by the policies used to generate it. content. A service bin models capabilities in a resource and 
The SCP essentially provides the control and management comprises one slot for each different capability tuple. A 
of the server-side (end) resources which may be character- 50 server manages a storage bin which is used for each content, 
ized as global, i.e., resources dynamically (on-demand) Content is associated with capabilities via the binding of a 
allocated by the system and having a lifetime determined by storage bin with a service bin. However, because not all 
the system; and local, i.e., resources pre-allocated and content is identical, it is necessary to obtain a worst resource 
presented to the system as possessing a lifetime that is bound for content. Each server is assigned content that is 
independent of the system. As in any distributed object 55 local and/or replicated. Replicated content is globally main- 
scheme, the local resource is a resource that is associated tained while local content is referred to as original content, 
and maintained by its originating private provider. A global Binding a storage bin to a service bin forms a streaming 
resource represents an on-demand created resource that is connection — the binding being evaluated at a quality of 
made available at one or more global system providers. service. The service bin is associated with resource require- 
Particularly, the system administrators control the place- 60 ments needed to stream content in such resolution. Each 
ment of object replicas onto global resources, as well as the content is associated with a resource profile. For constant bit 
ratio of global to local resources for each particular server. rate, this is possible in terms of bandwidth at the expense of 
The placement of replicas, and ratio of global to local computation. For variable bit rate, it is necessary to obtain 
resources may be constant, or changed based on preferences a performance envelope for the worst case. The resource 
of content owners or service provider. 65 profile is used to facilitate the evaluation of the binding. 

Global resources may be made available as a function of Advantageously, the system and method of the invention 

integrated demand from clients. To satisfy the requests of provides important enhancements to resource management 
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architectures not yet available today: 1) it provides a highly routers (e.g. 131, 132, etc.), to a backbone network 100. As 

available distributed decision capability that may be used to a result, all clients and servers are interconnected into a set, 

achieve and enforce properties over the usage, as well as where, abstracting of the security issues, all the clients may 

patterns of usage, of end-resources; 2) it creates a resource share all the servers. FIG. 2 is an illustration depicting a 

monitoring point with respect to the performance of multiple 5 single virtual network 200 interconnecting all the clients 

end resources and clients and their usage patterns so as to ( 121 > U 2 * 123 and I 24 ) and servers (111, 112, 113, and 114). 

provide parameters on where, when, and how to satisfy a For exemplary purposes herein, the network (200) backbone 

request; and 3) it provides an insertion point that can be used resources and its charactenstics, such as bandwidth, latency, 

for dynamic insertion of both resources and content in a etc " are explicitly regulated by an RSVP network, such as 

manner transparent to clients. 10 vBNS > < not shown ) used b y the lnternet2 or New Generation 

Internet (NGI). 

BRIEF DESCRIPTION OF THE DRAWINGS The network resources, dedicated to any end system, like 

a specific client or server, are limited and may vary. For 

Further features, aspects and advantages of the apparatus example, resources may provide extra computing power, 

and methods of the present invention will become better ^ such ^ supe rcomputing arrays, real-time encoders, such as 

understood with regard to the following description, encryption engines, etc., however, for exemplary purposes 

appended claims, and accompanying drawings where: herein> res0 urces include those associated with multimedia 

FIG. 1 illustrates the current Internet as comprising a content servers. It is understood that the solutions provided 

plurality of local networks interconnecting subsets of herein may be applicable to any kind of resource, 

servers, and subsets of clients. 2Q Additionally, end-system resources may be heterogeneous, 

FIG. 2 is an illustration depicting a single virtual network using different operating systems, architecture, processors, 

for interconnecting all the clients and servers of FIG. 1. memory bandwidth, storage, etc. It is assumed that the client 

FIG. 3 is an illustration depicting the provision of repli- resources are sufficient for running single user applications, 

eating links from a server (111) to servers (113) and (114), e *S* mcdia streaming. Such clients may share server 

which must be initiated in advance, to provide continued 25 resources. For example, as shown in FIG. 2, clients (122) 

support of growing demand for objects in the virtual net- and ( 123 > rec l uest s*™* ( m > to Produce an individual 

work of FIG 2 stream (215) and (216), respectively, for each of them, and 

,™ A .„ . . . . o „ ■ ^ . i r>i clients (121) and (122) share server (111) by. requesting 

FIG. 4 illustrates the provision of a Service Control Plane , . 7 t . U M \ j *■ 1 T * i- 

•jji /jAftx j- , • ,u , • , ,j streaming connections (201) and (202), respectively. It ch- 

middleware (400) according to the invention that is located , ,i<y£ a^^a\a • / * * fr ^ 

, 4 ' • c it _ . ™ ents (123) and (124) desire to request content from the server 

between the clients and the server resources of the virtual JU ; , ' , , ^ - n>1 . . . x , 

i c t-i/- i (111) (via dotted lines 203 and 204, respectively) such 

network of FIG. 2. v J ) . , r , ' rt1i r, J \. 

requests may be declined, if the server (111) does not have 

FIG. 5 illustrates servers (111, 112 and 113) as comprising en0U gh of a specific resource, or a combination of resources, 

both local resource storage shaded in dark-gray (501) and In tQe same timej server (113 ) and ( U4 ) may be temporary 

the global resource storage as shaded as light gray (502). ^ underused, having an additional resource to satisfy requests 

FIG. 6 illustrates the SCP (400) as comprising two layers: from clients (123) and (124), but owning no correspondent 

a Service Management Layer and a System Management content. If this content is replicated from the server (HI) to 

Layer according to the invention. servers (113), and (114), then requesting clients (123) and 

FIG. 7(a) illustrates an example replica directory (666) (124) may be re-addressed to servers (113) or (and) (114). 

including schema and data associated with the replica direc- 40 However, two scenarios may govern this: 

tory. First, assuming that the requested content already exists 

FIG. 7(b) illustrates an example server directory (656) on the servers (113) and (114), the system of the invention 

including schema and data associated with the server direc- enables tracking of the "hot" content and where it is located, 

tory. I fl this case > the system automatically re- addresses client 

FIG. 8 illustrates the distribution of object replicas over <s [ et * uests { ° other scrvm with same " hot " content > but 

the territory of the United States, as determined by the having sufficient resources and willingness to accept those 

System Management Layer of the SCP at a particular instant re ques . 

0 f ti m6 Second, if there are no more sufficient available resources 

FIG.9 illustrates a graphical depiction of the capacity ° D * U ( the se, T ers TT/ ^system, the requested 

i . , . J- ♦ a. r j l j- * 50 content may be replicated from the server (111) to the 

shaping mechanism according to the preferred embodiment /ni\ nnA rr^A\ a a -i ui . V * /i-v*\ 

of the invention servers (113) and (114) and made available to clients (123) 

_ A . i and (124) via respective links (310) and (311), as shown in 

FIG. 10 is an illustration depicting the streaming of a first FIG. 3. As media content may be quite large, for example l 

type of content consuming resources si, ml, cl and bl and hour M PEG-2 stream requires on the order of t>l Gbytes of 

a second type of content consuming resources s2,m2,c2 and ss storage> the rcp ii calion may ta ke time substantially longer 

k 2, than clients (123) and (124) agree to wait. Hence, as shown 

FIG. II illustrates an example scenario where a system in FIG. 3, the replicating links (301) and (302) must be 

administrator has configured all server resources onto two initiated in advance, to continue the support of all the 

partitions, one local and one global. growing demand during the time interval that is sufficient for 

DETAILED DESCRIPTION OF THE 60 the / e P lica ^ n of Particular content. 

PREFERRED EMBODIMENT r ™' ~° te "f?^ Intemel streamin 8 applications estab- 
lish individual flow between a client and a server, taking care 

FIG. I is a depiction of the current Internet as comprising of the QoS, security, encoding and all other issues on 

a plurality of local networks (101, 102, etc.) interconnecting per-flow basis. In order to provide means to intervene into 

subsets of servers, (e.g. Ill, 112, etc.) and subsets of clients, 65 this process, a Service Control Plane (SCP) middleware is 

(e.g. 122, 123, etc.), usually positioned in local proximity provided such as shown in FIG. 4. The SCP (400) is located 

from those servers. In turn, local networks are connected via between the clients (I2l, . . . ,124) and the server resources 
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(111, . . . ,114), interrupts client requests, finds their rate, available capacity on web servers by dynamically shaping 

density and proximity, determines what content is "hot" and both demand and capacity based on some criteria established 

predicts the distribution of the requests for such "hot" by the system policies (652). Those policies define set 

content. On the other hand, the SCP (400) monitors the criteria for exploration of the willingness of the servers to 

availability of the resources, maps the requests to the servers s provide services, negotiation policy between overall demand 

with available resources, predicts utilization of the end- and placement of replicas, etc. Further details regarding the 

resources and if necessary, dynamically re-distributes the capacity shaping aspect of the invention may be found in 

content. Because the SCP enables borrowing of under used commonly-owned, co-pending U.S. patent application Ser. 

resources from servers (113) and (114) for increasing the ^W^ffi ^ T^ERNET 

resources of the server (111), for example, the resources at 10 ,™LL T d / w < !L o Sr . .? i 7 

the server 111 grow sufficiently and on-time to satisfy ENVIRONMENT, now U S. Pat No. 6,466,980 the whole 

_ , , . . . ... J contents and disclosure of which is incorporated by refer- 

expeditiously all the requests to the content on this server. An „ a ;e fil „„ frt , tK ■„ J 

tt, cm i ui * i i . .1 ence as if fully set forth herein. 

lne SCP also enables returning resources back to the A u n P c ,. . . . A 

, i ,i . £ i j As shown on MCj. 6, a client (601) instantiates a request 

lending servers when the rate of requests decreases. u« l • ♦ — • i *•* u> * /*t\>*\ c 

° t H (602), which m turn invokes a negotiator object (603) for 

The invention classifies server resources, which are avail- 15 i oca ti n g and negotiating the request with the service object 

able for borrowing, as global, and other resources as local. (604), e.g., streaming content. 

Local resources are those resources that are persistent, i.e., As described in aforementioned herein incorporated 

the content using the local resources is not changed dynami- co-pending U.S. patent application Ser. No. 09/335,272 now 

cally by the system, but only by way of system administra- U.S. Pat. No. 6,463,454, the negotiations are based on user 

tors. Global resources are those resources that are used to 20 profiles, per-flow policies, and server and replica directories, 

dynamically replicate the content, in order to accommodate User profiles (611) characterize the client resources, such as 

the overall volume of requests, or demand. For example, processing power, memory, available bandwidth, etc., and 

FIG. 5 illustrates servers (111, 112 and 113) as comprising exemplify user preferences, such as preferable compression, 

both local resource storage shaded in dark-gray (501) and cost limits, interactivity level, e.g., the possibility to control 

the global resource storage as shaded as light gray (502). It 25 the flow of content in time, previewing set-up, willingness to 

is understood that system administrators are empowered to acce P l 01 re J ect commercial advertisement, etc. The nego- 

designate the ratio of global resource to local resource at a tiator ob J ect ( 603 ) uscs tentative placements, derived from 

server. For example, in FIG. 5, the server (114) is configured both a ' e P lica director y *™ < 666 ) ( see , F * G 7 ( fl » and a 

as completely global, while other servers (111, 112 and 113) server director y service ( 6S ® < see HG - 7 W)> and ?» c » tes 

are partially global, and partially local. 30 * query strategy to query the servers associated with those 

, - , . tentative placements. The resulting placement is provided 
, ^'Su* 8 r fened e ™ bod ! men ' ° fthe nve°tion accordin to lhe exploration and negotiation per-flow poli- 
tic SCP (400) has three major functions: 1) it controls and cks E loration u defines ^ strat how 
shapes the distribution and placement of requests onto shouW fae ded ^ Uel ^ tiation 
servers; 2) it controls and shapes the distribution and place- u ^ ^ tQ refine ^ fc tenWiwe lacements and 
ment of replicas across servers according to some set enabu choosj of a lacemeQt based on criteria inchlding . 
criteria; and, 3) it is responsible for dynamically creating serve[ resource availability e . g a xlyct navi , wim 

destroying, and moving replicas across servers as deemed « »r , j •* i_ • 

7 &> & v uwmvu ness "green factor, and server capacity, e.g., a server having 

necessary. a « n jgh» availab le capacity. The combination of these two 

FIG. 6 illustrates the SCP (400) as essentially comprising 4Q cr iteria makes the risk of service rejection minimal over 

two layers: a Service Management Layer (600a) and a urne , aQ d facilitates the overall load balancing, than in case 

System Management Layer (600*>). Both layers enable a 0 f placement to a "red" and low capacity server. Per-flow 

mapping transitivity of a received client request to an object policies allow customizing placement, and could either be 

replica, but constitute separate services. i oaded dllring initialization, or on-demand. 

Specifically, the Service Management Layer (600a) pro- 45 As mentioned, negotiations involve optional per-flow 

vides management of the per-flow connection by receiving services, including, but not limited to: a security service 

requests for objects and mapping those requests to the (621), a billing and pricing service (622), a transcoding 

particular server(s) based on factors such as: aggregated service (623), and a streaming service (624). For instance, 

demand statistics, and willingness of the servers to provide the security services (621) may be added to provide user 

the requested services. A more detailed description of this 50 authentication and authorization as well as encryption/ 

aspect of the invention is provided in commonly -owned, decryption of data. The pricing service (622) enables media 

co-pending U.S. patent application Ser. No. 09/335,272, services to be offered at various costs and permit a wide 

entitled SYSTEM AND METHOD FOR INTEGRATED variety of pricing policies. The billing service (622) allows 

LOAD DISTRIBUTION AND RESOURCE MANAGE- the cost of the service to be charged to the service user. The 

MENT IN AN INTERNET ENVIRONMENT, now U.S. 5S transcoding service (623) may request a real-time compres- 

Pat. No. 6,463,454 the whole contents and disclosure of sion method, as requested by a profile. The streaming service 

which is incorporated by reference as if fully set forth (624) allows the media servers to conform to the various 

herein. The Service Management Layer (600a) further pro- functions, such as a provision to publish or update media. 

vides a mapping of a request to the object, taking in The security service (621), billing and pricing service (622), 

consideration various parameters, such as user preferences 60 transcoding service (623), and streaming service (624) each 

embodied by user policies (611), client parameters, and also may plug-in to the service management layer (600a) via 

per-flow policies embodied as policies (612), such as a custom facades or interfaces 631, 632, 633 and 634, respec- 

policy for service delivery, including availability, quality, lively. It should be understood that the list of services is not 

interactivity level, etc. exhaustive, and others may be added by implementing 

The System Management Layer (600£>) provides an 65 associated facades. 

aggregated demand and capacity management and control Within the System Management Layer (6006) the service 

by attempting to match predicted demand for web objects to objects (604) undergo another negotiation process (643) 



02/19/2004, EAST Version: 1.4.1 



US 6,516 : 

11 

which is directed to generating and distributing object 
replicas over available servers. As shown in FIG. 6, this 
negotiation process takes into account aggregated demand 
statistics (653), selects the set of "hot", or most popular 
objects according to system policies (652) and server 5 
resource profiles (651). For example, FIG. 8 illustrates the 
distribution of object replicas over the territory of the United 
States, as determined by the System Management Layer 
(600/?) at a particular instant of time. 

As further shown in FIG. 6, the initial placement of ™ 
replicas onto servers (671), (672) and (673) is defined by the 
service provider and (or) content owner, and reflected in the 
replica directory (666) and server directory (656). The server 
brokers (605) reflect the willingness of the servers to accept 
the service, which is monitored and controlled locally by the 35 
server itself, and reported as utilization state indicated as 
Red, Yellow or Green states into the server directory (656) 
(see FIG. 1(b)). For example, if the local server (671) holds 
the replica which is requested by the client (601), and the 
condition of the server (671) is "Green," the Service man- 20 
agement layer may bind the client (601) and the replica on 
the server (671), establishing the streaming flow (680) 
between them. After the binding between the client and a 
server is established, the QoS control, interactivity control, 
etc., is further provided by the point-to-point application 25 
itself. However, if the per-flow QoS parameters for an 
individual request cannot be delivered, then the control 
returns back to the System Management Layer, which 
remaps the request to another server. 

It should be understood that the number of replicas 30 
associated with a given object is variable over time and 
driven by predicted demand for a web object. When the 
Service Management Layer (600a) places the request to the 
System Management Layer (600fc), it may be discovered 
that the demand analysis for the replica of the particular 35 
object and the analysis of availability of capacity may be 
mismatched, e.g., as a result of a decrease or increase of 
demand for the object replica. If the mismatch is of a degree 
in accordance with a set criteria, then the Service Manage- 
ment Layer generates an under/over capacity audits (691) 40 
such as described in co-pending U.S. patent application Ser. 
No. 09/335,273 now U.S. Pat. No. 6,466,980. In response, 
the System Management Layer initiates a comprehensive 
analysis, which could possibly lead to the creation and/or 
deletion of replica(s), effectively shaping capacity based on 45 
demand. 

As further shown in FIG. 6, via server broker objects 
(605) which represent available servers, the negotiator 
objects (643) finally maps the object replica onto one or 5Q 
several global servers, e.g. server (672). Simultaneously, the 
negotiator reports the new placement of the replica to the 
replica directory (656) in the service management layer. The 
mapping of a replica may involve an initiation of replication 
(682), or migration (683), or deletion process. The change in 55 
replica placement is reported back to the directory (656), 
effectively initiating a process (692) for incrementing/ 
decrementing capacity, i.e., adding/deleting the server 
address out of the replica directory (656). 

Note, that optional system services, such as replication 60 
(625), or aggregated cost control (626) may be exploited by 
the negotiator object (643) while deciding on the destination 
of replica and speeding up the replication process. 

FIG. 9 illustrates a graphical depiction of the capacity 
shaping mechanism according to the preferred embodiment 65 
of the invention. Further details of the capacity shaping 
mechanism may be found in above-mentioned co-pending 
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U.S. patent application Ser. No. 09/335,273 now U.S. Pat. 
No. 6,466,980. As shown in FIG. 9, the line (661) represents 
the number of request streams measured per pre-established 
time interval. The dotted line (662) symbolizes the number 
of possible streams per this interval. During the initialization 
process, as indicated at (663) before the time segment T(i), 
no transient (or global) replicas exist. However, during the 
T(i) interval, the demand exceeds capacity to a degree, 
which leads the System Management Layer (600b) to add 
one transient replica to one of the global servers, which has 
declared its willingness to accept the replica, i.e., possess a 
"green" indicator of resource availability. At the end of the 
interval T(i+1) two transient replicas are added, due to the 
predicted high growth in demand, which generated an under 
capacity condition. However, the prediction is not realized 
during several next time intervals, and finally the demand 
falls to a level which generates an over capacity condition. 
If the replica expiration time shown on FIG. 9 is expired at 
the end of the interval T(N-1), then the System Management 
Layer (600/?) does not enable the replicas to extend their life 
on the related servers. As a result, all the transient replicas 
are destroyed, and the overall capacity will fall to the initial 
level. Finally, as shown in FIG. 9, the demand grows fast 
again, so at the end of the control interval T(N) three new 
transient replicas are added. 

According to the invention, server resources may be 
subdivided into two groups of resources: storage and service 
resources. In order to normalize capabilities of the hetero- 
geneous servers, the notion of storage and service bins 
respectively, is introduced. For example, resources required 
by a particular application, e.g., a streaming application, 
may be experimentally or theoretically evaluated by a size of 
storage s, required to store the content, and a tuple (m, c, b), 
where m , c and b symbolize memory, CPU cycle allocation 
(e.g., in million instructions per second (MIPS)), and band- 
width resources that are required to stream the content. 

FIG. 10 illustrates streaming of MPEG-1 content, e.g., 
consuming resources si (1001), ml (1002), cl (1003) and 
bl (1004). Streaming of another type of content, e.g., one 
coded in MPEG-2, may require s2 (1011), m2 (1012), c2 
(1013) and b2 (1014) resources. Thus, as a convenient 
means of measurement of the consumption of resources by 
a various classes of applications, the resources are charac- 
terized as storage s and service r(m,c,b) bins. Consequently, 
the resource is viewed as a set of storage and service bins, 
which represent capabilities for provisioning services. As 
not all applications are identical, it is necessary to obtain a 
worst resource bound for content, requiring experimental 
measurement of different classes of applications for various 
servers. However, this approach allows convenient normal- 
ization and reservation of heterogeneous server resources by 
various classes of applications. 

The relevant disclosure described in greater detail in 
found in commonly-owened, co-pending U.S. patent appli- 
cation Ser. No. 09/335,272 now U.S. Pat. No. 6,463,454 
describes the method of overall resource monitoring for the 
case when several applications run in parallel. However, it is 
assumed that service bins are additive. In other words, the 
overall service resource consumed by several applications 
that are running in parallel is equivalent to a sum of resource 
bins required by each application. The overall storage is 
always a sum of storage bins, unless the applications share 
the same content, and therefore require a common storage 
bin. 

A system administrator configures overall resources as 
local and global. The administrator is responsible for estab- 
lishing the ratio of local to global resources for each server 
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as well as to establishing policies of "green" "yellow" and behind its firewall. The delegate controller performs demand 
"red" load limits for those resources. After configuring a and capacity monitoring tasks for the content provider and 
partition as global storage, the global resource management when critical thresholds are exceeded, the delegate control- 
takes over the control of this resource. Thus, a global storage i er places a replication request to the ISP's controller. The 
bin represents a partition that can only be reserved by the s isp controller then arbitrates between pending replication 
global resource management provided by SCR Note, that the rcqucsts and detcrm ines which replication requests get allo- 
system administrator or the server itself, depending on a cated t0 which lo5al serveKf In particulaj . ±t same global 
relevant policy, may re-c aim the global resource in full or scrvcr bc sharcd acrQSS diffefcnt dd controllerSi 
partially, by requesting its release from the SCP manage- ' . . 
ment. The native reservation management process (1020) as A skilled artisan will appreciate that content providers, for 
shown in FIG. 10, is responsible for monitoring resource 10 security reasons, may not be willing to disclose nor allow 
consumption, and determining the server willingness to another party to access its internal usage statistics. For this 
accept new requests. It is understood that, the native process reason, content providers may be enabled to directly request 
may discriminate between application requests for global tne placement of a specific replica onto the ISP global 
and local content. Additionally, a request for placement of a servers. Furthermore, such request may be conditioned (by 
global replica may be declined or accepted, as controlled by 15 the content provider as opposed by the controller) to meet 
a pre-installed policy that enables maximization of the cost particular requirements such as geography or capacity of the 
of global resources that may be billed to a global resource tar g et S lobal server. The ISP controller may then try to locate 
requester. Such a policy may also depend on request statis- one su ch particular global server satisfying such require- 
tics monitored by the SCP. ments. For example, a particular content provider may place 
As mentioned, local storage resources are dedicated to 20 a replication request such as "get me a replica of the 
storing persistent replicas of content, while transient replicas following content within the next 5 minutes and locate such 
are replicated using the global storage. Further, the local transient replica in a HIGH capacity server in the Southeast 
storage bin is always associated with one or more local °^ trie USA' . 

service bins, and a global storage bin is associated with one Moreover, according to the principles of the invention, 

or more global service bins. 25 ISPs may collaborate in agreements allowing an ISP to 

FIG. 11 illustrates an example scenario where a system handoff replication requests to other ISPs having suitable 

administrator has configured all server resources onto two global servers. For example, in the event that none of an 

partitions, one local and one global. As shown in the ISP ' S global servers satisfies the requirements associated 

example scenario depicted in FIG. 11, two streaming appli- 30 with a request placed by a content provider, the ISP would 

cations share a transient content replica and consume one then handoff the replication request to some other friendly 

global storage bin s2 (1102) and two service bins r2 (1103) ISP. 

and r3 (1104), while one application consumes a persistent Moreover, according to the principles of the invention, an 

content replica included in service storage bin si (1101). auction for the placement of such replication request may be 

Overall global storage resource consumed is thus equivalent 35 implemented by an intermediary party. Such intermediary 

to s2, while an overall global service resource is a sum of party may then negotiate the handoff such replication 

bins r2 and r3. request to the most capable or suitable party according to 

There are many useful applications of the present inven- some cost metric such as cost or an expiration deadline, 

tion. First, the current QoS approach of many QoS manage- A skilled artisan will appreciate that it is possible for a 

ment systems suggests decreasing of quality, if such an event 40 replica to be placed in a global server, which does no longer 

occurs. Contrary to this, according to the invention, a fit demand characteristics. To this end, the present invention 

customer will receive the quality which he/she agrees to pay augments the replica management system described in 

for. Customers will most probably refuse to buy the content related U.S. patent application Sen No. 09/335,273 now 

without the ironclad service guarantee, and may even dis- U.S. Pat. No. 6,466,980 by introducing of periodical sanity 

agree with the reduction of quality even if the charges for the 45 checks, i.e., implementation of a process to review and 

service were reduced. The invention demonstrates how to enhance the overall placement of transient replicas on the 

provide a mutually agreed quality of service between the system, which is referred to herein referred to as replica 

client and the system with a high level of service guarantees. migration. 

Additionally, the present invention may be used as a [ n conjunction with the SCP, a final aspect of the inven- 

value-added service by a Internet Service Providers (ISP) to 50 tion is directed to a Resource Management Framework 

the content providers. It allows to support a broadcast (RMF) which is an extensible, distributed, policy-based, 

content providers (such as, for illustrative purposes only, a object-oriented management framework that supports per- 

cable TV News Network "NN") by dynamically matching flow media setup and contro i between one or more 

demand to capacity of NN's servers. When necessary, the med i a servers and end users and may augment the SCP. This 

ISP would place transient replicas (of NN content) on its 55 extensible framework is described in greater detail in 

own global servers based on characteristics about the commonly-owned, co-pending U.S. patent application Sen 

demand for NN content as presented to the ISP. No. 09/335,275 which is currently pending, entitled 

The present invention may additionally be used to provide POLICY-BASED MULTIVARIATE APPLICATION- 

statistical sharing of ISP resources (i.e., globally-shared LEVEL QoS NEGOTIATION FOR MULTIMEDIA 

servers) when supporting multiple broadcast content pro- 6 o SERVICES, the whole contents and disclosure of which is 

viders (such as for illustrative purposes only television and incorporated by reference as if fully set forth herein. The 

media networks). This way, the ISP controller may manage RMF framework comprises a mandatory core set of basic 

the allocation of replication requests according to some cost functions upon which further features can be added. Such 

metric such as profit or content provider maximum loss new extended functional compositions for the framework 

protecting most likely, the best interest of the ISP. 65 include the addition of user profiles and preferences, security 

In another implementation, each independent content (authentication and authorization), pricing and billing, 

provider runs a server referred to as a delegate controller reservations, resource monitoring and others. In addition, 
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the policies, upon which management decisions and service 
selections are based, are >pluggable=and new or updated 
policies may be added, to augment or replace existing policy 
and allow new behavior or refinement of existing behavior. 

The RMF is designed to augment the SCP to facilitate the 
management of the heterogeneous media streaming systems. 
The low-level network connections, streaming and media- 
control remain the domain of the media streaming systems. 
RMF hooks into those lower levels and provides a higher 
level management and control of those services. For 
example, the RMF includes QoS mapping down to media 
level services based on application level requests, but defers 
the setup of the network connections to the SCP. Where 
media streaming may be adaptively rate controlled and 
managed by the media server, within the bounds permitted 
by the QoS specification, then the media server can act 
autonomously. When it is unable to maintain the required 
QoS level then an exception is raised to the SCP, which may 
then, according to contact policies involved, attempt to 
expand or migrate the end-resources, and then re-map the 
clients requests, allowing them to receive services else- 
where. 

While the invention has been particularly shown and 
described with respect to illustrative and preformed embodi- 
ments thereof, it will be understood by those skilled in the 
art that the foregoing and other changes in form and details 
may be made therein without departing from the spirit and 
scope of the invention which should be limited only by the 
scope of the appended claims. 

Having thus described our invention, what we claim as 
new, and desire to secure by letters patent is: 

1. A system for regulating server resources in a distributed 
wide area network comprising servers interconnected by a 
computer networking environment, said resources including 
multimedia object resources and resources for storing and 
streaming said multimedia objects to requesting clients, the 
system comprising; 

first service management means implementing objects for 
receiving client requests for streaming object resources 
from a server to said client, generating statistics asso- 
ciated with demand for said object resources, and 
automatically mapping said requests to available serv- 
ers in said network based on predetermined criteria, 
including user profiles for characterizing client 
resources and user preferences relating to receipt of 45 
said object resource, and per- flow services relating to 
one or more selected from the group comprising, 
transcoding object content, offered pricing for said 
content, billing, authorizing and authenticating users, 
and encrypting/decrypting data content, and wherein 
said first management means further includes one or 
more first negotiator objects for locating a service 
object in said distributed network capable of handling 
said request according to said predetermined criteria, 
generating and placing inquiries related to said received 
requests, and negotiating requests with a service object; 
and, 

second system management means implementing objects 
for automatically shaping capacity of object resources 
at said servers in said distributed network based on said 
demand statistics, current available capacity of said 
servers and a willingness factor associated with each 
server for providing said requested resource, said sec- 
ond management including means for automatically 
replicating an object associated with said objects 
resources and temporarily placing said object replicas 
at one or more server devices having available capacity 
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according to said predetermined criteria, wherein per- 
formance for placing requests of said first management 
means has an input of and considers said capacity 
shaping provided by said second management means. 

2. The system as claimed in claim 1, wherein said client 
resources includes one or more of processing power, 
memory, an available bandwidth for receiving said object 
content at a client resource. 

3. The system as claimed in claim 1, wherein said user 
preferences includes preferable compression of said content, 
cost limitations for said content, and interactivity level 
including real-time interaction with flow of said object 
resource content. 

4. The system as claimed in claim 1, wherein said criteria 
includes willingness of said servers to provide said 
requested services, and available capacity of said server for 
providing a streaming resource. 

5. The system as claimed in claim 1, further including 
facade means for interfacing per-flow services with said first 
management means, said facade means enabling dynamic 
insertion of both resources and content in a manner trans- 
parent to clients. 

6. A system for regulating server resources in a distributed 
wide area network comprising servers interconnected by a 
computer networking environment, said resources including 
multimedia object resources and resources for storing and 
streaming said multimedia objects to requesting clients, the 
system comprising: 

first service management means implementing objects for 
receiving client requests for streaming object resources 
from a server to said client, generating statistics asso- 
ciated with demand for said object resources, and 
automatically mapping said requests to available serv- 
ers in said network based on predetermined criteria, 
wherein said first management means further includes 
one or more first negotiator objects for locating a 
service object in said distributed network capable of 
handling said request according to said predetermined 
criteria, generating and placing inquiries related to said 
received requests, and negotiating requests with a ser- 
vice object, wherein first negotiator objects invoke a 
first directory service means for locating said service 
object, said first directory service means comprising a 
mapping of service objects to locations of associated 
object replicas at said servers, and said first negotiator 
objects invoke a second directory service for mapping 
available distributed servers with an available capacity 
rating, wherein said second directory service further 
maps a willingness indicator to an associated server, 
said willingness indicator indicating degree of willing- 
ness of a server to receive placement inquiries; 

second system management means implementing objects 
for automatically shaping capacity of object resources 
at said servers in said distributed network based on said 
demand statistics, current available capacity of said 
servers and a willingness factor associated with each 
server for proving said requested resource, said second 
management including means for automatically repli- 
cating an object associated with said objects resources 
and temporarily placing said object replicas at one or 
more server devices having available capacity accord- 
ing to said predetermined criteria, wherein performance 
for placing requests of said first management means has 
an input of and considers said capacity shaping pro- 
vided by said second management means wherein said 
second management means includes one or more ser- 
vice broker objects representing available servers in 
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said network, said service broker objects interacting generating and placing inquiries related to said received 

with said second negotiator objects for placing gener- requests, and 
ated object replicas associated with an object resource 

to said global servers throughout said distributed net- negotiating requests with a service object, which step of 

work. S negotiating requests includes the step of negotiating for 

7. The system as claimed in claim 6, wherein said second per-flow services and implementing one or more 

management means includes means for generating object facades for dynamically inserting optional including 

replicas associated with object resources for placement at both resources and content in a manner transparent to 

global servers throughout said distributed network, said clients, said per-flow services including transcoding 

second management means further including one or more ao object contentT offered pricing for said bim 

second negotiator objects for distributing object replicas 4 , ■ ■ A iU *• *■ j / 

authorizing and authenticating users, and encrypting/ 

over available servers. a • a 

8. The system as claimed in claim 7, wherein said second decrypting data content; 

management means generates object replicas on demand. automatically shaping capacity of object resources at said 

9. The system as claimed in claim 7, wherein a number of 15 servers in said distributed network based on said 
generated object replicas associated with an object resource demand statistics, current available capacity of said 
is determined in accordance with predicted demand for a servers and a willingness factor associated with each 

given object, server capable of providing said requested resource; 

10. The system as claimed in claim 1, wherein said servers 

in said network are normalized as comprising a storage bin 2 o automatic ally replicating an object associated with said 

of a size for storing object resource content, said second objects resources and temporarily placing said object 

management means managing placement of content across replicas at one or more server devices having available 

storage bins so that access optimizations may be made. capacity according to said predetermined criteria, 

11. The system as claimed in claim 10, wherein said wherein performance for placing requests for streaming 
servers in said network are normalized by representing a 25 resources has an input of and considers said 
server as comprising an associated service bin characterized capacity shaping step; and 

as a tuple comprising memory, CPU allocation and stream- 
ing bandwidth required for provisioning services. characterizing client resources according to user profiles 

12. The system as claimed in claim 10, wherein a object and user preferences relating to receipt of said object 
resource streaming connection between a server and a client 30 resource, and implementing said user profiles and user 
is instantiated by binding a storage bin of an associated preferences as said predetermined criteria. 

server to its service bin while taking into account one or method as claimed in claim 18, wherein said step 

more quality of service parameters, of locatiQg a service object includes referencing a first 

13. The system as claimed in claim 12, wherein a stream- j- * r * i.- * * 1 r 

i . . directory service for mapp ins service obiects to locations of 

ing connection is instantiated by binding a storage bin of an 35 , . , . 4 

• . j , . . • /■ . -i , * ■ ■ * associated obi ect replicas at said servers, 

associated server to its service bin while taking into account J F 

one or more quality of service parameters. 20 - ^ method as claimed in claim 19, further including 

14. The system as claimed in claim 10, wherein said referencing a second directory service for mapping available 
second management means manages placement of content distributed servers with an available capacity rating, 
across storage bins according to geography of demand. 40 21. The method as claimed in claim 20, further including 

15. The system as claimed in claim 14, wherein said referencing a second directory service for mapping available 
second management means manages placement of content distributed servers with a willingness indicator indicating a 
rrnTnd 510 '^ ^ " a * * degree ° f willin S ness of a server t0 receive placement 

16. The system as claimed in claim 14, further including 45 lDC l uines ' 

a signaling protocol clients and content servers enabling 22 * ^ method as claimed in claim 18, wherein said step 

control servers to monitor the placement of requests from °f automatically replicating an object associated with said 

clients onto said servers. objects resources and temporarily placing said object repli- 

17. The system as claimed in claim 1, wherein the wide cas at one or more server devices includes implementing or 
area network comprises the Internet. 50 m0 re second negotiator objects for distributing object rep- 

18. A method for regulating server resources in a distrib- ji cas over available servers. 

uted wide area network comprising servers interconnected ^ ^ method as daimed in c]aim lg wherein said 

by a computer networking environment, said resources ~ . A A . A . • . j , i ^ • . 

* i a- i, 1# * j- . • t . c of generating statistics associated with demand for said 

including multimedia object resources and resources for . & 

storing and streaming said multimedia objects to requesting 55 object resources ^eludes: 

clients, the method comprising: predicting a demand for a given object resource; and, 

providing objects for receiving client requests for stream- determining a number of generated object replicas asso- 

ing object resources from a server to said client, gen- ciated ^ gaid ^ demand 

erating statistics associated with demand for said object ^ _ , J , . ,. f . „ 

resources, and automatically mapping said requests to 60 24 ^ method 35 claimed m claim 22 ' ^ nher including 

available servers in said network based on predeter- providing one or more service broker objects representing 

mined criteria, wherein said step of automatically map- available servers in said network for interaction with said 

ping requests further includes steps of, second negotiator objects to place generated object replicas 

implementing one or more first negotiator objects for associated with an object resource to said global servers 

locating a service object in said distributed network 65 throughout said distributed network, 

capable of handling said request according to said 25. The method as claimed in claim 18, further including 

predetermined criteria, characterizing said servers in said network as comprising a 
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storage bin of a size for storing object resource content, and 
managing placement of content across storage bins. 

26. The method as claimed in claim 25, further including 
characterizing said servers in said network as comprising an 
associated service bin including a tuple comprising memory, 5 
CPU allocation and streaming bandwidth required for pro- 
visioning services. 

27, The method as claimed in claim 26, further including 
the step of instantiating an object resource streaming con- 
nection between a server and a client by binding a storage 



bin of an associated server to its service bin while taking into 
account one or more quality of service parameters. 

28. The method as claims in claim 18, further including 
implementing a signaling protocol between clients and con- 
tent servers for monitoring the placement of requests from 
clients onto said servers. 

29. The method as claimed in claim 18, wherein the wide 
area network comprises the Internet. 
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